home
***
CD-ROM
|
disk
|
FTP
|
other
***
search
/
Collection of Internet
/
Collection of Internet.iso
/
infosrvr
/
dev
/
www_talk.930
/
000476_connolly@pixel.convex.com _Thu Dec 10 17:52:06 1992.msg
< prev
next >
Wrap
Internet Message Format
|
1994-01-24
|
3KB
Return-Path: <connolly@pixel.convex.com>
Received: from dxmint.cern.ch by nxoc01.cern.ch (NeXT-1.0 (From Sendmail 5.52)/NeXT-2.0)
id AA05767; Thu, 10 Dec 92 17:52:06 MET
Received: by dxmint.cern.ch (5.65/DEC-Ultrix/4.3)
id AA02503; Thu, 10 Dec 1992 18:05:32 +0100
Received: from pixel.convex.com by convex.convex.com (5.64/1.35)
id AA13092; Thu, 10 Dec 92 11:05:26 -0600
Received: from localhost by pixel.convex.com (5.64/1.28)
id AA23223; Thu, 10 Dec 92 11:05:24 -0600
Message-Id: <9212101705.AA23223@pixel.convex.com>
To: www-talk@nxoc01.cern.ch
Subject: <, >, and & characters in HTML
Date: Thu, 10 Dec 92 11:05:24 CST
From: Dan Connolly <connolly@pixel.convex.com>
I'm trying to write libHTML so that applications
can just deal with C style character strings, and
the library does all the SGML details.
The use of < and & to represent < and & never
seemed to fit cleanly into the SGML view of things.
So I posted to comp.text.sgml.
I think I'm a lot clearer on the matter now. The
< and > entities are meant to be used in typesetting
mathematics, where a less-than symbol is not necessarily
the same thing as a '<' character.
There's a mechanism for referencing characters in the
document character set in such a way that they will
not be treated as markup: numeric character references.
I'd like to get rid of the <, >, and & entities
from the HTML DTD. Granted, there will be a transition
period while providers adjust, but I think it will make
the spec cleaner.
Anyway, here's what the experts had to say...
[edited...]
Newsgroups: comp.text.sgml
From: Erik Naggum <SGML@ifi.uio.no>
Reply-To: Erik Naggum <enag@ifi.uio.no>
Message-ID: <19921210.008@erik.naggum.no>
Date: 10 Dec 1992 07:36:57 +0100
References: <1992Dec10.020628.4884@news.eng.convex.com>
Subject: Re: hiding <, >, and &
Lines: 111
[Dan Connolly]
:
| There is a lot of need for a routine represents an arbitrary string
| of characters as SGML data -- a routine that hides <tags> etc. from
| the parser.
The simplest would be to use character references for the characters
that you need to quote. See 9.5 Character Reference, [357:10-13].
...
The way I see it, this is a completely failsafe technique.
| [It does bring up the question of representing " and ' characters in
| attribute value literals. Hmm.. another situation I think I'll
| just avoid.]
What's wrong with "'", '"', """, and '''?
...
</Erik>
--
Erik Naggum ISO 8879 SGML +47 295 0313
ISO 10744 HyTime
<erik@naggum.no> ISO 9899 C Memento, terrigena
<enag@ifi.uio.no> ISO 10646 UCS Memento, vita brevis